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ABSTRACT 

We  investigate  the  effects  of  asymmetry  on  estimates  of  variance  of  robust 
estimator  in  location  and  regression  problems,  showing  that  heavy  skewness  of 
errors  can  seriously  bias  the  common  variance  estimates  for  location  and 
intercept,  a problem  that  can  be  corrected  by  jackknifing  for  location  but  is 

more  intractable  for  the  intercept  in  regression.  The  scale  parameters  in 
regression  seem  not  to  be  as  seriously  subject  to  this  bias  if  the  sample  size 
is  large  compared  to  the  number  of  parameters. 
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Introduction 

The  theoretical  results  and  Monte-Carlo  studies  in  the  area  of  robustness 
have  in  the  main  focused  on  symmetric  distributions  (Andrews,  et  al  (1972))  or 
procedures  which  are  not  scale  equivar iant(which  effectively  eliminates  most 
problems  due  to  asymmetry  when  the  number  of  dimensions  in  the  problem  is 
fixed).  Recently,  Huber  (1973)  and  Bickel  (1978)  have  examined  situations  in 
which  the  asymmetry  of  errors  can  lead  to  quite  complicated  results.  In  this 
paper  we  study  the  effects  of  asymmetric  errors  in  two  very  simple  situations: 

(one  and  two  dimensional)  location  problems  and  simple  linear  regression.  We 
have  focused  our  attention  on  estimating  the  variability  of  robust  point 
estimators  in  these  problems. 

A major  difficulty  with  considering  asymmetric  errors  has  been  that  location 
(intercept)  is  not  uniquely  defined.  However,  asymmetric  data  do  occur  and  there 
are  situations  where  data  transformations  to  achieve  symmetry  either  make  no 
sense  or  are  not  possible.  In  regression,  it  might  be  conjectured  that  asym- 
metry has  different  effects  on  intercept  and  slope  (see  Section  3);  if  so, 
there  will  be  situations  where  one  might  invest  much  effort  in  data  transforma- 
tions, when  the  parameters  of  interest  are  not  influenced  by  the  asymmetry. 

This  alone  seems  a good  reason  for  studying  asymmetry,  but  as  another  example, 
in  ranking  and  selection  problems,  one  is  often  interested  in  the  stochastically 
largest  population;  robust  estimators  have  been  proposed  in  place  of  the  sample 
mean  for  this  problem,  and  a data  transformation  which  achieves  symmetry  for  all 
populations  may  not  be  possible. 

In  Section  2,  we  study  the  effects  of  asymmetry  of  errors  for  the 
one-dimensional  location  problem,  using  M-estimates,  trimmed  means,  and  an 
adaptive  trimmed  mean  due  to  Hogg  (1974).  In  Section  3,  we  study  M-estimates  of 
regression.  Our  major  qualitative  conclusions  are  as  follows: 

__  : 

. — — A 


(1)  The  published  estimates  of  variance  for  M-est mates  of  location  will 
be  consistently  small  if  the  errors  are  heavily  skewed.  This  problem  appears 
amenable  to  solution  if  a variance  estimate  suggested  by  jackknifing  is  used. 

(2)  In  linear  regression,  a similar  conclusion  holds  true  for  the  inter- 
cept. Theoretical  and  Monte-Carlo  results  show  that  slope  parameters  are 
influenced  only  negligibly  by  skewness.  Jackknifing  for  the  intercept  does  not 
work  as  well  here,  with  the  variance  estimates  now  being  slightly  too  large. 

(3)  In  the  two-sample  problem  with  equal  scales,  a robust  test  statistic 
can  be  constructed  with  a consistent  variance  estimtate,  even  if  the  errors  are 
asymmetric.  This  statistic,  however,  has  unknown  sensitivity  to  heteroscedasti- 
city  of  variances. 

Location  Estimates 


In  this  section  we  define  the  location  estimates  used  in  the  study,  state 
two  Lemmas  and  give  a discussion  of  expected  results.  An  M-estimator  is  defined 
as  a solution  to  the  equation 


(2.1) 


£ <K(X.  - Tn)/ 


Sn) 


0, 


where  sn  is  an  estimate  of  scale.  In  the  Monte-Carlo  study,  we  used  two  func- 
tions defined  by 


(2.2) 


i|i(x)  = -ipC- x)  = max(-2,  min(x,2)) 


(2.3) 


i^x)  = -\}>(-x) 

= x 0<x<2.25 

= 2.25  2.25  < x _<  3.75 

= 2.25  (15-x)/11.25  3.75  x < 15 

=0  x > 15  . 


The  function  (2.2)  is  referred  to  as  Huber's  while  (2.3)  is  Hampel's  \p.  The 
solutions  to  (2.1)  were  generated  iteratively  (Gross  (1977)),  starting  with  the 


sample  median  and  with  two  estimates  of  scale 
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1 ’ J sin  " (median  absolute  residual)/. 6745 

(2.5)  s2n(new)  “((n-1)  1 £ «T((X.  - Tn)/s2n(old))}s;n(old)/a  , 

where  a * «T(Z),  the  expectation  taken  under  the  standard  normal  distribu- 

tion Junction.  Two  estimates  of  the  variance  of  n3  Tn  were  used,  namely 


(2.6) 


(2.7) 


Dln(i)  " 1,2  \n  (n-1)_1  J ^aXj-Tn)/sin)/bi  * where 

-1  ? -1 
b = n 1 l K»’((X  -T  )/s  ),  H x i ♦ n A (1-b  )/b. 

i j=j  j n in  l 


i l 


D (i)  = l.OB(n-l)"1  l w. . (Y . -T  ),  where 
-n  J n 

w..  - <K(X.-T  )/s.  )/ ( (X . -T  ) / s . ) . 
ij  j n"  in  j n”  in 


We  used  the  factor  1.08  in  0,^(1)  so  that  the  two  estimates  have  approximately 
the  same  value  for  normal  samples  in  our  Monte-Carlo  experiment,  and  we  used  the 
factor  H"  in  DJn(i)  as  suggested  by  Huber  (1975). 

The  variance  estimates  0 (i)  and  D0  (i)  are  essentially  as  suggested  by 

Gross  (1977),  the  first  motivated  by  the  asymptotic  variance  formula  for  n*2  T 

/ (x)dF (x)/ { / (x)dF (x)  , 


appropriate  when  F is  symmetric,  the  second  suggested  by  the  weighted  least 
squares  nature  of  the  iterative  estimation  procedure.  In  the  general  case  with 
possible  asymmetry,  the  behavior  of  M-estimates  T^  can  be  summarized  by  the 
following  result,  for  which  exact  conditions  can  be  generated  from  the  method 
of  proof  of  Theorem  1 of  Carroll  (1978a). 


* 
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Lemma  1 . Suppose  that  for  some  constants  I\(F),  o^(F)  we  have  that  * T.(F) 
n'J(s.n  - o.(F))  - Cyi),  and  E ^((Xj  - T.(F))/o. (F) ) « 0.  Taking  T.(F)  - 0, 
o.(F)  ■ l (without  loss  of  generality),  for  ^ smooth. 


l n i 

C.8)  = n'1  l *(X.)  ♦ (1  - s.n)Ep  X^'CXj)  ♦ 0 (n‘l) 


Discussion  of  Lemma  _l 

ihe  following  qualitative  considerations  emerge  from  Lemma  1 and  are 
verified  in  the  Monte-Carlo  experiment. 

(I)  If  the  distribution  function  F is  not  symmetric,  then,  in  general, 

F.pX^ v|» * (X, ) ^ 0 so  that  Djn(i)  (which  is  asymptotically  correct  in  the  symmetric 
case)  will  typically  underestimate  the  true  variance.  The  same  should  hold  for 

D:n(i)- 

(J)  The  bias  in  L> ^ n ( i ) and  D2n(i)  should  be  small  for  distributions  which 
are  nearly  symmetric  because  Tn  is  a smooth  function  of  the  data.  The  bias  will 
become  large  as  the  degree  of  asymmetry  increases. 

(5)  When  S2n  is  used  as  scale,  Tn  should  be  a particularly  smooth  function 
of  F.  From  Jaeckel  (1972),  this  means  that  jackknifing  will  probably  be  effec- 
tive in  estimating  the  variance  of  T„.  As  pointed  out  by  Huber  (1977,  p.26), 
such  a variance  estimate  will  be  more  appropriate  for  Tn  than  for  the  jackknifed 

version  of  T . 

n 

We  also  consider  trimmed  means  and  two  adaptive  versions.  Define  U^(a) (I.  (a) ) 
as  the  mean  of  the  largest  (smallest)  no  order  statistics.  As  a measure  of  tail 
length,  when  not  is  an  integer  (true  in  the  situations  considered  here),  Hogg  (1974) 
proposed 

Qn  = (Un(.20)  - Ln(.20))/(Un(.50)  - Ln(.50)). 


Then,  if  ml.  refers  to  a ml.  symmetrically  trimmed  mean,  the  Hogg  adaptive  estimate 
is  defined  by 


5% 

if 

u 

1.81 

10% 

if 

1.81  < Q < 

1.87 

25% 

if 

Q > 

1.87  . 
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The  estimate  of  an  ml  trimmed  mean  is  given  by  Shorack  (1974),  Huber  (1977, 

eq.(10.4))  has  shown  that  this  estimate  essentially  arises  from  jackknifing  the 

2 

m\  trimmed  mean.  Calling  this  variance  a (m%) , we  define 

D(Hogg)  = a“(m\)  if  HG1  = m%. 

We  also  consider  an  estimate  suggested  by  Switzer  (1970),  defined  by 

2 

Switzer  = m%  if  o“(m%)  is  minimum  among 

o2(S%),  a2(10%)  and  o2(25%). 

If  one  assumes  that  n * (Q^  - Q(F))  has  a limit  distribution,  it  is  easy  to 
obtain  the  following  disturbing  result,  precise  conditions  for  which  could  be 
given  but  are  omitted. 

L 

Lemma  2.  Suppose  Q(F)  = 1.81.  If  F is  symmetric  about  0(F),  then  n ' (HG 1 - 0(F)) 
has  a non-normal  limit  distribution.  If  F is  not  symmetric,  there  typically 
exists  no  finite  0 for  which  n a(HGl  - 0)  has  a limit  distribution. 


Sketch  of  proof.  Let  Tn(m)  be  the  m%  symmetrically  trimmed  mean  with 

L 

n (Tn(m)  - 0(m))  having  a normal  limit  distribution.  If  F is  symmetric  about 

u 

0(F),  since  is  even  and  T^fm)  is  odd,  n (Tn(m)  - 0(F))  is  asymptotically 
independent  of  n a(Qn  - Q(F)).  Define 

Aln  * ( Qn  1Q(P)},  A2n  = {Qn  > Q(F) } Bj  = {n^HGl  - 0(F))  z }. 

An  n ► «>, 
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IHBj)  * P(Aln  n Bj)  ♦ P(A2n  n Bj) 

- P(A]n  and  nV„(5)  - 6(F))  < z) 

♦ P(A2n  and  n*(T  (10)  - 0(F))  < z) 

a *aCPCn^CTnCS)  - 6(F))  <z)  ♦ P(nVn(10)  - 0(F))  < z)>, 

verifying  the  first  part  of  Lemma  2.  If  F is  asymmetric  and  0(5)  ^ 0(10) 

(the  usual  case),  for  any  0 , 

(2.9)  P (Bj ) S P(A]n  and  n'^T^S)  - 0(5))  < z + n^(0  - 0(5))) 

♦ P (A-,n  and  n^*(T  (10)  - 6(10))  j<  z + n!i(0  - 0(10))). 

Clearly,  no  0 can  be  chosen  so  that  the  right  hand  side  of  (2.9)  is  a probability 
distribution.  □ 

Because  of  Lemma  2 we  expect  HG1  to  do  very  poorly  in  terms  of  efficiency 
and  estimating  variance  if  F is  asymmetric  and  Q(F)  1.81.  For  the  negative 
exponential  distribution,  Q ( F)  ~ 1.805,  so  particular  problems  might  be  expected 
here . 

The  location  estimates  used  in  the  study  we  report  here  are  given  in  Table 
1.  These  represent  a portion  of  the  results  in  a larger  study  which  leads  to 
the  same  conclusions. 

The  Monte-Carlo  study  used  a shuffled  congruential  random  number  generator 
to  obtain  the  uniform  random  deviates.  The  Box-Muller  algorithm  was  used  to 
obtain  the  standard  normal  deviates.  Due  to  time  and  financial  considerations, 
various  sample  sizes  (500  < N < 2000)  were  used,  so  that  we  also  report  standard 


errors. 


TABLE  1 


A description  of  the  location  estimates  used  in  the  study 


Code 

M 

10% 

MCI 

Switzer 
Hub  11 
Hub  22 
Hamp  12 
Hamp  21 
Hub  1J 
Hamp  1J 
Hub  2J 


Sample  Mean 

10%  symmetrically  trimmed  mean 
See  text 
See  text 

Huber  with  s , and  D ,(1) 
nl  nl 

Huber  with  s ..  and  D .,(2) 
n2  n2 

Hampel  with  s , and  D _(1) 

1 nl  n2 

Hampel  with  s^,  and  (2) 

Huber  jackknife  variance  estimate  with  s, 
Hanpel  jackknife  variance  estimate  with  s 
Huber  jackknife  variance  estimate  with  s 


Let  2 be  a standard  normal  random  variable.  While  a wide  range  of  sampling 
distributions  were  investigated  (both  symmetric  and  asymmetric),  the  five 
presented  here  are  representative. 


‘Vp0 

Code 

2 

N ( 0 , 1 ) 

2 ♦ . 102“ 

. ION2 

2 ♦ .502“ 

. 50N2 

Negative  Exponential,  mean  1.25 

NF. 

.50  Exp (2) 

EXP (2) 

The  second  (.IDS')  is  only  slightly  skewed  and  was  chosen  from  a larger  set 
(2  ♦ . 052“ , 2 ♦ .252“,  EXP(.IOZ),  FXP(.252))  as  a reasonable  representative  of 
the  class  of  distributions  close  to,  but  not  exactly,  symmetric.  Such  data 
might  arise  for  example  from  data  transformations  which  only  achieve  approx- 
imate symmetry. 

The  sample  sice  is  n = 20.  If  N is  the  number  of  Monte-Carlo  experiments 
and  Yj,Y,,...,Y  the  realized  value  of  a location  estimator,  the  (standardized) 
Monte-Carlo  variance  is 


o 


n 

N 


The  average  value  of  a variance  estimate  of  n a ^ is  denoted  by  o“.  Table  2 

•>  t 

presents  the  values  ot'  ami  o"  and  their  standard  errors. 
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TABLE  2 

2 

The  first  row  for  each  estimator  gives  the  Monte-Carlo  variance  o of  the 
location  estimate  and  the  Monte-Carlo  average  of  the  appropriate  variance 
estimate.  The  second  row  consists  of  standard  errors. 


N (0 

,D 

.10N 

2 

.5  ON 

I2 

NE 

EXP(Z) 

2 

~2 

2 

-2 

2 

-2 

2 

-2 

2 

-2 

Code 

a 

0 

0 

a 

0 

0 

0 

0 

o 

a 

M 

1.03 

.99 

1.07 

1.01 

1.50 

1.47 

1.58 

1.55 

1.10  1 

.10 

.03 

.01 

.03 

.01 

.05 

.02 

.05 

.02 

.05 

.04 

10% 

1.09 

1.06 

1.10 

1.07 

1.24 

1.25 

1.31 

1.36 

.51 

.52 

.03 

.01 

.03 

.01 

.04 

.02 

.04 

.02 

.02 

.01 

Switzer 

1.14 

.90 

1.15 

.90 

1.57 

.93 

1.57 

1.04 

.52 

.34 

.04 

.01 

.04 

.01 

.05 

.01 

.05 

.02 

.01 

.01 

HG1 

1.09 

.96 

1.12 

.97 

1.75 

1.05 

1.63 

1.16 

.58 

.38 

.03 

.01 

.04 

.01 

.05 

.02 

.05 

.02 

.01 

.01 

Hub  11 

1.05 

1.01 

1.06 

.99 

1.49 

1.08 

1.50 

1.20 

.68 

.46 

.03 

.01 

.03 

.01 

.05 

.02 

.05 

.02 

.03 

.01 

Hub  22 

1.05 

1.02 

1.07 

1.02 

1.54 

1.21 

1.52 

1.31 

.68 

.56 

.03 

.01 

.03 

.01 

.05 

.02 

.05 

.02 

.03 

.01 

Hamp  12 

1.04 

1.06 

1.06 

1.07 

1.64 

1.24 

1.54 

1.38 

.77 

.53 

.03 

.01 

.03 

.01 

.05 

.02 

.05 

.02 

.03 

.01 

(lamp  21 

1.05 

1.01 

1.07 

1.00 

1.60 

1.14 

1.54 

1.28 

.74 

.45 

.03 

.01 

.03 

.01 

.05 

.02 

.05 

.02 

.03 

.01 

Hub  1J 

1.05 

1.01 

1.06 

1.02 

1.49 

1.49 

1.50 

1.62 

.68 

.70 

.03 

.01 

.04 

.01 

.05 

.03 

.05 

.03 

.03 

.02 

Hamp  1J 

1.04 

1.01 

1.06 

1.02 

1.64 

1.53 

1.54 

1.57 

.77 

.81 

.03 

.01 

.03 

.01 

.05 

.02 

.05 

.02 

.03 

.02 

Hub  2J 

1.05 

1.03 

1.07 

l .05 

1.54 

1 .61 

1.52 

1.70 

.68 

.75 

.03 

.01 

.03 

.01 

.05 

.05 

.05 

s 

.05 

.03 

. 03 
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Discussion  of  the  results  for  location  estimates 


The  tabled  results  confirm  our  earlier  speculations  and  might  be  summarized 
as  follows: 

(1)  It  is  difficult  to  estimate  the  variance  of  either  HG1  or  SWITZKR. 
Jackknifing  will  probably  not  help  here  as  the  estimates  are  not  smooth 
functionals  of  the  data. 

(2)  The  two  commonly  used  scales  Is.  ,s,  ) lead  to  similar  results. 

In  2n 

(5)  The  two  variance  estimates  D„j(i)  and  l>n?(i)  do  quite  well  for  N(l),l) 
and  . 10N-,  which  are  symmetric  or  nearly  so,  but  are  not  to  be  trusted  for 
heavily  skewed  data,  hither  a jackknife  or  t ransformt ions  appear  necessary. 

14)  The  jackknifed  variance  estimates  are  a dramatic  improvement  on 
l'njii)  and  There  is  no  simple  pattern  to  these  figures,  but  generally 

the  jackknifed  variance  estimates  appear  to  become  more  conservative  as  we  pass 
from  NlO,  l ) to  liXl’(Z) . 

The  optimistic  interpretation  given  here  to  the  use  of  the  jackknife  for 

variance  estimation  in  robustness  of  location  contrasts  with  that  of  Braun 

(1975),  who  concluded  that  jackknifing  and  robust  estimation  do  not  get  along 

very  well,  and  that  variance  estimates  obtained  from  jackknifing  are  not 

reliable,  fhe  closest  he  came  to  considering  our  M-estimates  was  a one-step 

M-estimate  starting  at  the  median  and  the  median  absolute  deviations  from  the 

median  IMAP,  essentially  our  s.  ).  We  believe  that  our  results  are  more 

favorable  to  the  jackknife  because  the  statistics  we  have  considered  are  much 

smoother  functionals  of  the  data  then  are  one-step  M-estimates  starting  at 

(median,  MAP),  (a  conjecture  of  this  type  has  been  made  by  Gross  (1976)), 

2 ^2 

because  instead  of  comparing  o and  o as  we  do  here,  he  compared  (in  the 

' 1 1 J 

main)  log(o*)  to  log  o~,  which  is  more  unstable  when  o >»■  1 , and  because  asymmetry 

is  unfavorable  to  (2.o)  and  (2.7).  We  used  the  one-step  M-estimate  with  tj1 


given  by  (2.2)  and  starting  at  (median,  MAP/. 6745)  and  obtained  the  table 
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•> 

a 

O from  (2.6) 
n 

o“  from  jackknife 


z 

.10Z2 

,50Z2 

NE 

EXP(Z) 

1.06 

1.08 

1,64 

1 .55 

.58 

.99 

.97 

.85 

1.04 

.33 

1.06 

1 .09 

1.90 

1.76 

.65 

Thus,  for  one-step  M-estimates,  the  jackknife  is  more  unstable  than  it  is  with 
smoother  M-estimates. 


Regression 

We  next  investigate  regression  to  see  if  similar  phenomena  to  those  found 
in  the  previous  section  continue.  Our  intuition  says  that,  under  asymmetry, 
the  common  estimates  of  variance  will  be  inconsistent  for  intercept  but  accep- 
table for  slope.  In  this  section  we  sketch  a proof  confirming  this  conjecture 
and  illustrate  small  sample  results  with  a Monte-Carlo  experiment. 

The  model  we  consider  is  (the  use  of  will  become  clear  later) 


(3.1)  Yi  = ^0  + ei  °0  (i  - 1 n)  , 

where  the  t.  are  i.i.d.  random  variables  with  lii)/(Cj)  = 0 and 
— i = ^ xil  xip) • Wc  consider  a version  of  Huber's  Proposal  2 (Huber 
(1977),  p.37),  which  involves  solving  the  equations 

i n 

(*•2)  n l «K(Y.  - x.  )/sn ) = 0 

(3.3)  (n  - p)'1  l ^2((Y  - x.  jU/s)  a. 

i«l  1 _1  -n  n 

While  we  will  assume  the  x.^  are  constants,  the  first  two  conditions  of  Lemma  3 
(to  follow)  arc  reasonable  and  may  he  justified  by  quoting  results  of  Maronna 


and  Yohai  (1978).  In  a subset  of  their  paper,  they  assume  (y-,,x7),. 

is  a sample  from  a distribution  function  P with  0^,  solving 

I:iK(y  - x 30)/o0)  = 0 
EiT((y  - X 30)/oq)  ° a . 

Defining  = (y^  - x^  3q)/Oq,  they  show  essentially  that  if  the  are  inde- 
pendent of  and  if  0^,  s^  satisfy  (3.2)  and  (3.3)  (the  latter  with  (n  - p) 
replaced  by  n) , then  n2(0n  - 0^)  and  nfys  - aQ)  are  asymptotically  normally 
distributed. 

The  proof  of  our  result  involves  Taylor  expansions  along  the  lines  of 
Carroll  (1978a,  1978b)  and  is  omitted  to  avoid  cluttering  up  the  paper  with 
messy  calculations.  Recall  that  our  x^  are  non-stochastic. 

Lemma  3 . Suppose  that 


"X  • V 

■ °p(1) 

n'(s  - o ) 

« 0 (1) 

n 0 

P 

n ” 1 X 1 X -*■  V 

(positive  definite) 

n 

n l x; 

> (1,0,0, ...,0)  = w 

i-1 

Then,  for  i|>  sufficiently  smooth. 


(3.4)  (a,V  - (a1aY/'a2)w'w)  (0^  - 0Q)/ao 

_ i 2 

= n l {x.  ' »/>(g  . ) - (a.  / a^)w  ' (^‘  (c, ) - a)} 
i=l  1 1 1 - 1 
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where 


aj  = E K^'CCj) 


a3  *=  E^»  (e1 ) 


a,  = 2E  Ej  (cl)  a4  = 2E  iKEjW  (Cj) 


□ 


In  simple  linear  regression  with 


V « 


1 0 
0 v 


we  obtain  that  if 


($.  ,,  3 , ) and  0' 

mt’  slope  ^-n 


(0.  ,,  S . ),  then 

mt  slope 


Corollary.  Under  the  conditions  of  Lemma  3,  for  simple  linear  regression 


(3-5)  <Sint  - 


(a3  * (a1a4/a2))’1n'1  l {^(e.)  - (a2/a2)  (^(e. ) - a) } ♦ 0 ( n" 1 ) , 


2 , 


-1, 


i=l 

,-l  -1 


(3-6)  (3slope  - Pslope)/o0  = (a3vlJ_1  xil  *(ci)  + V""1* 


□ 


Similar  results  hold  for  the  general  regression  problem. 


Discussion  of  Lemma  3.  The  following  conclusions  emerge  from  Lemma  3 (for 
large  samples)  and  are  partially  confirmed  for  small  samples  in  the  Monte-Carlo 
experiment  to  follow. 

(1)  It  can  be  shown  that  the  representations  (2.8)  and  (3,5)  are  equiva- 
lent when  scale  is  estimated  by  s^, 

(2)  The  representation  (3.6)  implies  that  in  large  samples,  even  with 
asymmetry 

(3’7)  Pslope  - Pslope  * Normal  (0>  o0E<,2(ri)/a3v1)  * 
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which  is  precisely  the  result  obtained  by  Huber  (1973)  for  known  scale  and  p 
f i xed . 

(3)  Thus,  for  fixed  p as  n becomes  large,  the  asymptotic  variance 
pmH.'iiii  for  1’  obtained  in  the  synmetric  case  with  known  scale  are  correct  in 
general  except  when  applied  to  the  intercept.  We  expect  to  experience  the  same 
difficulty  in  estimating  the  variance  of  the  intercept  that  we  found  in  the 
previous  section. 

(4)  For  estimating  the  variance  of  the  intercept,  the  equation  (3.0) 
shows  that  lope  *s  icient ly  smooth  for  jackknifing. 

We  constructed  a Monte-Carlo  experiment  for  n = 20  in  the  simple  linear 
regression  model 


1 * ^ x|  ♦ ci* 


where  the  values  of  x are  -.95,  - .90, . . . , .90, .95  and  the  error  distributions 

were  as  in  the  previous  section.  There  were  N=2000  iterations  for  least 

squares  and  the  Huber  estimate,  and  N=1200  iterations  for  the  jackknife.  In 

~2  2 

Table  2 we  present  the  values  of  o^/o  for  $s^0pe  and  using  least  squares, 

the  Huber  Proposal  2 with  i(>(x)  = max(-2,  min(x.2))  and  the  jackknifed  variance 
estimate  for  the  Huber  Proposal  2.  The  estimate  of  variance  used  is  essentially 
l>ln(2),  and  if  r.  = (y.  - x4  J^)/»2n* 


L>ln(2)  = lT  (n-p)  l i|T(r .)/b~,  where 
j = l J 


b“  = n'1  l \l>’  (r.),  H * 1 ♦ p(l-b)/(bn). 
j = l J 


In  Fable  3 we  present  the  average  values  ft  . and  ft.  , for  both  least 
1 p slope  int 

squares  anil  Proposal  2.  We  conclude 


15 


(1)  The  slope  estimate  Bslope  *s  hardly  influenced  by  even  large  dosages 
of  asymmetry.  It  appears  relatively  unbiased  and  its  variance  can  be  assessed 
relatively  accurately.  In  separate  Monte-Carlo  experiments,  we  have  found  that 
this  phenomena  extends  to  a quadratic  regression  with  uniform  design  (n=30)  and 
to  the  poison-treatment  3x4  design  of  Box  and  Cox  (1964). 

(2)  The  intercept  estimate  Hint  influenced  by  asymmetry  and  standard 
variance  formulas  will  tend  to  underestimate  the  true  variance  (in  asymmetric 
cases)  even  more  than  will  happen  in  least  squares. 

(3)  The  jackknife  appears  to  be  conservative,  more  so  than  in  the  location 
case.  Hinkley  (1977)  shows  that  the  jackknife  as  employed  here  (the  so-called 
balanced  jackknife)  leads  in  general  to  a biased  variance  estimate  even  in  least 
squares,  unless  the  design  is  balanced.  In  his  example,  the  design  is  much  less 
balanced  than  ours  and  the  behavior  of  the  balanced  jackknife  much  worse. 

Further  study  of  his  "weighted"  jackknife  appears  necessary,  although  for  this 
particular  design,  our  unreported  simulation  results  for  the  weighted  jackknife 
are  virtually  identical  to  those  given  here  for  the  usual  jackknife. 


(4)  Consider  a two-sample  location  problem 
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Uj 


e. 


ij 


(j  = 1.2 


= 1, . . . ,n  ) . 


The  results  of  the  previous  section  show  that  a statistic  formed  by  analogy 
with  the  t-statistic  (X  replaced  by  Tn(Xlt . . . ,Xn) ,etc.)  will  tend  to  have  a 
higher  type  I error  than  advertised.  However,  define 
(nj  ♦ n2)$0  = njpj  ♦ n2lJ2»  (nl  * n2)  = n2(Pi  - U2)»  and 


il 


x.  = 1 
i 


r 1 , . . . ,n 


1 


Zi  * Vn,,,  *i  ■ -Vn2  1 ■ "l  * 1 "2  ' 

Then  we  have  the  linear  model  and,  by  estimating  scale  simultaneously  through 
(3.2)  and  (3.3),  Lemma  3 and  Corollary  1 tell  vis  that  we  can  consistently 
estimate  the  variance  of  the  estimate  of  pj  - P2  and  hence  obtain  proper  tests. 
This  of  course  leads  to  the  usual  t-statistic  if  i^(x)  = x. 


1<> 


TABLE  2 

The  entries  arc  the  ratio  nn/°  for  the  appropriate  estimates  and  sampling 


situat ions . 

Est imat ion 
Method 

N(0,I) 

Distribution 

•ION"  . SON' 

N£ 

EXP (ZJ 

®int 

P . 
slope 

ft  ft 

int  1 slope 

Pint 

P . 
slope 

*int 

O 

1 s 1 ope 

0.  „ B , 

int  slope 

Least  Squares 

.98 

.98 

.96  .98 

.98 

.99 

.99 

1.00 

1.00  1.00 

Huber 

.95 

.98 

.93  .98 

.75 

.96 

.83 

.99 

.69  .94 

Jackknifed 

Huber 

1.02 

1 .06 

. . 

1 .07 

1.11 

1.11  1.14 

TABLE  3 

The  entries  are  the  average  values  of  the  parameters  over  the  sampling  situations 

The  true  values  are  0.  ,=  1.00,  6 . = .50 

int  slope 

N(0, 1)  .ION2  .50N2 NE EXP(Z) 


Binf  Least  Sciuares  1-00  1-05  1.00  1.00  1.00 

^slope,  Least  sRuar*s  -4y  .49  .48  .48  .48 


J 
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